The Structural Topic Model and Applied Social Science∗

نویسندگان

  • Margaret E. Roberts
  • Brandon M. Stewart
  • Dustin Tingley
  • Edoardo M. Airoldi
چکیده

We develop the Structural Topic Model which provides a general way to incorporate corpus structure or document metadata into the standard topic model. Document-level covariates enter the model through a simple generalized linear model framework in the prior distributions controlling either topical prevalence or topical content. We demonstrate the model’s use in two applied problems: the analysis of open-ended responses in a survey experiment about immigration policy, and understanding differing media coverage of China’s rise. 1 Topic Models and Social Science Over the last decade probabilistic topic models, such as Latent Dirichlet Allocation (LDA), have become a common tool for understanding large text corpora [1].1 Although originally developed for descriptive and exploratory purposes, social scientists are increasingly seeing the value of topic models as a tool for measurement of latent linguistic, political and psychological variables [2]. The defining element of this work is the presence of additional document-level information (e.g. author, partisan affiliation, date) on which variation in either topical prevalence or topical content is of theoretic interest.2 As a practical matter, this generally involves running an off-the-shelf implementation of LDA and then performing a post-hoc evaluation of variation with a covariate of interest. A better alternative to post-hoc comparisons is to build the additional information about the structure of the corpus into the model itself by altering the prior distributions to partially pool information amongst similar documents. Numerous special cases of this framework have been developed for particular types of corpus structure affecting both topic prevalence (e.g. time [3], author [4]) and topical content (e.g. ideology [5], geography [6]). Applied users have been slow to adopt these models because it is often difficult to find a model that exactly fits their specific corpus. We develop the Structural Topic Model (STM) which accommodates corpus structure through document-level covariates affecting topical prevalence and/or topical content. The central idea is to ∗Prepared for the NIPS 2013 Workshop on Topic Models: Computation, Application, and Evaluation. A forthcoming R package implements the methods described here. † These authors contributed equally. We assume a general familiarity with LDA throughout (see [1] for a review) By “topical prevalence” we mean the proportion of document devoted to a given topic. By “topical content” we mean the rate of word use within a given topic.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Role of Internet Dependency on Online Social Capital among Graduate Students in University of Putra Malaysia

This study examined to study how respondents rely on the Internet to fulfill the various life goals dimensioned into understanding, orientation and playing goals, and how this dependency relates to the generation of social capital. Further, it examined the resources of social capital in terms of bonding social capita and bridging social capital. In this study quantitative research approach was ...

متن کامل

Patterns of Physical Activity and Its Impact on the Quality of Life: A Structural Equation Modeling Analysis

Background. In many countries, including Indonesia, the tendency for non-communicable diseases is increasing. Consequently, health costs must be paid by the state and continue to increase. People's lifestyles, including lack of physical activity, are thought to have contributed significantly to the problem. Objectives. This study aims to examine the impact of physical activity on quality of ...

متن کامل

A Structural Model of the Relationship Between Teachers\' Job Motivation and Socio-Emotional Skills and Social Capital Mediated by Self-Efficacy

The purpose of this study was to consider the goodness of fit of a structural model of the relationship between teachers' job motivation and their social capital and social-emotional skills with the mediation of self-efficacy. The research method was descriptive-correlational. The statistical population included female teachers working in the 10th and 5th districts of Tehran in the second secon...

متن کامل

Presentation of Economic Regeneration Model in Historic Fabric Based on Order in Structural Functionalism Theory

Historic fabric can perform an important role in the development of cities. Urban sustainable regeneration is one of the recent approaches in historic fabric. In this approach, all indicator of sustainable development including economic, social, cultural, management and environmental dimensions have been used in conservation of the historic fabric. All the principles of sustainable development ...

متن کامل

Providing a structural model for psychological problems based on disconnection and rejection domain and negative automatic thoughts with mediating role of experimental avoidance

Introduction: Psychological problems are the result of a person's interaction with the environment and include behaviors that cause social conflicts, dissatisfaction and individual unhappiness. The present study aimed to provide a structural model for psychological problems based on disconnection and rejection domain and negative automatic thoughts with mediating role of experimental avoidance....

متن کامل

Developmental and Structural Analysis of the Social and Cultural Changes and Its Effects on Local House in Turkmen, Gomishan and Gorgan

The Clarification of the field-oriented thought in Iranian contemporary architecture through the sociological knowledge model can be studied and analyzed as a comparative model as well as the location of the phenomena of Iranian contemporary society, especially architecture. Although sociology of knowledge must be able to control the cause of social and historical deviation in the context of Ir...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013